Mixture Component Clustering for Efficient Speaker Verification

نویسندگان

  • Richard D. McClanahan
  • Phillip L. De Leon
چکیده

In speaker verification (SV) systems based on a support vector machine (SVM) using Gaussian mixture model (GMM) supervectors, a large portion of the test-stage computational load is the calculation of the a posteriori probabilities of the feature vectors for the given universal background model (UBM). Furthermore, the calculation of the sufficient statistics for the mean also contributes substantially to computational load. In this paper, we propose several methods to cluster the GMMUBM mixture components in order to reduce the computational load and speed up the verification. In the adaptation stage, we compare the feature vectors to the clusters and calculate the a posteriori probabilities and update the statistics exclusively for mixture components belonging to appropriate clusters. Our results, demomstrate that (on average) we can, reduce the number of a posteriori probability calculations by a factor up to 2.8× without loss in accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting GMM-based Quality Measure for SVM Speaker Verification

In this paper, we examine the problem of quality measurement for speaker verification using support vector machines (SVMs). An efficient Gaussian mixture models (GMMs) based quality estimation algorithm is proposed to potentially utilize speaker-specific broad acoustic-class characteristics. Some verification strategies are also considered in the test phase. We perform clustering-based vector p...

متن کامل

Compute Efficient Training Method for Gaussian Mixture Model Based Speaker Verification

Speaker Verification is a memory and compute intensive process, giving rise to area and latency concerns in the way of its System-On-a-Chip implementation. The training schemes for computing the speaker models contribute significantly to the overall complexity in the implementation of the system. In this paper, we demonstrate that K-Means algorithm can be used to realize compute efficient train...

متن کامل

Efficient text-independent speaker verification with structural Gaussian mixture models and neural network

We present an integrated system with structural Gaussian mixture models (SGMMs) and a neural network for purposes of achieving both computational efficiency and high accuracy in text-independent speaker verification. A structural background model (SBM) is constructed first by hierarchically clustering all Gaussian mixture components in a universal background model (UBM). In this way the acousti...

متن کامل

Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model

We have proposed a novel speaker clustering method based on a hierarchically structured utterance-oriented Dirichlet process mixture model. In the proposed method, the number of speakers can be determined from the given data using a nonparametric Bayesian manner and intra-speaker variability is successfully handled by multi-scale mixture modeling. Experimental result showed that the proposed me...

متن کامل

Efficient Text-Independent Speaker Identification using Optimized Hierarchical Mixture Clustering

Conventional Speaker Identification(SI) Systems uses individual Gaussian Mixture Models(GMM) for every speaker. If this method used for the large population Speaker identification systems, then during identification, likelihood computations between an unknown speaker's test feature vectors and speaker models has become a time-consuming process. This approach also increases the computationa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012